Overview

Brought to you by YData

Dataset statistics

Number of variables10
Number of observations3276
Missing cells1434
Missing cells (%)4.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory256.1 KiB
Average record size in memory80.0 B

Variable types

Numeric9
Categorical1

Alerts

ph has 491 (15.0%) missing valuesMissing
Sulfate has 781 (23.8%) missing valuesMissing
Trihalomethanes has 162 (4.9%) missing valuesMissing
Hardness has unique valuesUnique
Solids has unique valuesUnique
Chloramines has unique valuesUnique
Conductivity has unique valuesUnique
Organic_carbon has unique valuesUnique
Turbidity has unique valuesUnique

Reproduction

Analysis started2025-10-23 14:37:47.958225
Analysis finished2025-10-23 14:38:01.466500
Duration13.51 seconds
Software versionydata-profiling vv4.17.0
Download configurationconfig.json

Variables

ph
Real number (ℝ)

Missing 

Distinct2785
Distinct (%)100.0%
Missing491
Missing (%)15.0%
Infinite0
Infinite (%)0.0%
Mean7.0807945
Minimum0
Maximum14
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:01.630569image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.4879707
Q16.0930919
median7.0367521
Q38.0620661
95-th percentile9.7898186
Maximum14
Range14
Interquartile range (IQR)1.9689742

Descriptive statistics

Standard deviation1.5943195
Coefficient of variation (CV)0.22516111
Kurtosis0.72031558
Mean7.0807945
Median Absolute Deviation (MAD)0.984117
Skewness0.025630448
Sum19720.013
Variance2.5418547
MonotonicityNot monotonic
2025-10-23T09:38:01.829747image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.1894136691
 
< 0.1%
6.5380840871
 
< 0.1%
5.915806751
 
< 0.1%
8.1364978691
 
< 0.1%
6.4937641751
 
< 0.1%
6.9774056331
 
< 0.1%
5.4892480551
 
< 0.1%
2.5581027991
 
< 0.1%
7.3121093041
 
< 0.1%
8.554096971
 
< 0.1%
Other values (2775)2775
84.7%
(Missing)491
 
15.0%
ValueCountFrequency (%)
01
< 0.1%
0.22749905021
< 0.1%
0.97557798981
< 0.1%
0.98991221291
< 0.1%
1.4317815551
< 0.1%
1.7570371151
< 0.1%
1.8445383661
< 0.1%
1.9853833591
< 0.1%
2.1285314341
< 0.1%
2.3767680761
< 0.1%
ValueCountFrequency (%)
141
< 0.1%
13.541240241
< 0.1%
13.349888561
< 0.1%
13.175401721
< 0.1%
12.246928071
< 0.1%
11.907739831
< 0.1%
11.898078031
< 0.1%
11.621140131
< 0.1%
11.568767971
< 0.1%
11.563169061
< 0.1%

Hardness
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.3695
Minimum47.432
Maximum323.124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:02.010645image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum47.432
5-th percentile141.76328
Q1176.85054
median196.96763
Q3216.66746
95-th percentile249.60977
Maximum323.124
Range275.692
Interquartile range (IQR)39.816918

Descriptive statistics

Standard deviation32.879761
Coefficient of variation (CV)0.16743823
Kurtosis0.61577168
Mean196.3695
Median Absolute Deviation (MAD)19.844989
Skewness-0.039341705
Sum643306.47
Variance1081.0787
MonotonicityNot monotonic
2025-10-23T09:38:02.183243image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
195.10229861
 
< 0.1%
204.89045551
 
< 0.1%
129.42292051
 
< 0.1%
224.23625941
 
< 0.1%
214.37339411
 
< 0.1%
181.10150921
 
< 0.1%
188.31332381
 
< 0.1%
248.07173531
 
< 0.1%
203.36152261
 
< 0.1%
118.98857911
 
< 0.1%
Other values (3266)3266
99.7%
ValueCountFrequency (%)
47.4321
< 0.1%
73.492233691
< 0.1%
77.45958611
< 0.1%
81.710895271
< 0.1%
94.091307481
< 0.1%
94.812545221
< 0.1%
94.908977131
< 0.1%
97.28090861
< 0.1%
98.36791491
< 0.1%
98.452930511
< 0.1%
ValueCountFrequency (%)
323.1241
< 0.1%
317.33812411
< 0.1%
311.38395651
< 0.1%
308.25383291
< 0.1%
307.70602411
< 0.1%
306.62748141
< 0.1%
304.23591211
< 0.1%
303.70262671
< 0.1%
300.29247581
< 0.1%
298.09867951
< 0.1%

Solids
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22014.093
Minimum320.94261
Maximum61227.196
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:02.355345image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum320.94261
5-th percentile9545.8126
Q115666.69
median20927.834
Q327332.762
95-th percentile38474.99
Maximum61227.196
Range60906.253
Interquartile range (IQR)11666.072

Descriptive statistics

Standard deviation8768.5708
Coefficient of variation (CV)0.39831625
Kurtosis0.44282609
Mean22014.093
Median Absolute Deviation (MAD)5809.4719
Skewness0.62163449
Sum72118167
Variance76887834
MonotonicityNot monotonic
2025-10-23T09:38:02.528838image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17404.177061
 
< 0.1%
20791.318981
 
< 0.1%
18630.057861
 
< 0.1%
19909.541731
 
< 0.1%
22018.417441
 
< 0.1%
17978.986341
 
< 0.1%
28748.687741
 
< 0.1%
28749.716541
 
< 0.1%
13672.091761
 
< 0.1%
14285.583851
 
< 0.1%
Other values (3266)3266
99.7%
ValueCountFrequency (%)
320.94261131
< 0.1%
728.75082961
< 0.1%
1198.9436991
< 0.1%
1351.9069791
< 0.1%
1372.0910431
< 0.1%
2552.9628041
< 0.1%
2808.0257561
< 0.1%
2835.3031651
< 0.1%
2912.2112471
< 0.1%
3413.0816331
< 0.1%
ValueCountFrequency (%)
61227.196011
< 0.1%
56867.859241
< 0.1%
56488.672411
< 0.1%
56351.39631
< 0.1%
56320.586981
< 0.1%
55334.70281
< 0.1%
53735.899191
< 0.1%
52318.91731
< 0.1%
52060.22681
< 0.1%
51731.820551
< 0.1%

Chloramines
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1222768
Minimum0.352
Maximum13.127
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:02.738715image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0.352
5-th percentile4.5030537
Q16.1274208
median7.130299
Q38.114887
95-th percentile9.7531005
Maximum13.127
Range12.775
Interquartile range (IQR)1.9874663

Descriptive statistics

Standard deviation1.5830849
Coefficient of variation (CV)0.22227231
Kurtosis0.58990117
Mean7.1222768
Median Absolute Deviation (MAD)0.99166134
Skewness-0.01209844
Sum23332.579
Variance2.5061578
MonotonicityNot monotonic
2025-10-23T09:38:02.946615image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7.5093058571
 
< 0.1%
7.3002118731
 
< 0.1%
6.6352458841
 
< 0.1%
9.2758836031
 
< 0.1%
8.0593323771
 
< 0.1%
6.5465999741
 
< 0.1%
7.5448687891
 
< 0.1%
7.5134084661
 
< 0.1%
4.5630086861
 
< 0.1%
7.8041735531
 
< 0.1%
Other values (3266)3266
99.7%
ValueCountFrequency (%)
0.3521
< 0.1%
0.53035129471
< 0.1%
1.3908709051
< 0.1%
1.6839925811
< 0.1%
1.9202714491
< 0.1%
2.1026909911
< 0.1%
2.3866534941
< 0.1%
2.397984991
< 0.1%
2.4560135961
< 0.1%
2.4586091951
< 0.1%
ValueCountFrequency (%)
13.1271
< 0.1%
13.043806111
< 0.1%
12.912186641
< 0.1%
12.653362021
< 0.1%
12.626899741
< 0.1%
12.580026491
< 0.1%
12.363284831
< 0.1%
12.279374181
< 0.1%
12.24639411
< 0.1%
12.227175281
< 0.1%

Sulfate
Real number (ℝ)

Missing 

Distinct2495
Distinct (%)100.0%
Missing781
Missing (%)23.8%
Infinite0
Infinite (%)0.0%
Mean333.77578
Minimum129
Maximum481.03064
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:03.124043image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum129
5-th percentile266.61623
Q1307.6995
median333.07355
Q3359.95017
95-th percentile403.07019
Maximum481.03064
Range352.03064
Interquartile range (IQR)52.250673

Descriptive statistics

Standard deviation41.41684
Coefficient of variation (CV)0.12408582
Kurtosis0.64826281
Mean333.77578
Median Absolute Deviation (MAD)26.095176
Skewness-0.035946622
Sum832770.56
Variance1715.3547
MonotonicityNot monotonic
2025-10-23T09:38:03.304676image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
280.74562291
 
< 0.1%
332.74451921
 
< 0.1%
391.91822861
 
< 0.1%
330.90537041
 
< 0.1%
402.31342711
 
< 0.1%
360.69781511
 
< 0.1%
336.04045181
 
< 0.1%
405.52733721
 
< 0.1%
366.21442791
 
< 0.1%
301.23084821
 
< 0.1%
Other values (2485)2485
75.9%
(Missing)781
 
23.8%
ValueCountFrequency (%)
1291
< 0.1%
180.20674641
< 0.1%
182.39737021
< 0.1%
187.17071441
< 0.1%
187.42413091
< 0.1%
192.03359171
< 0.1%
203.44452081
< 0.1%
205.93509061
< 0.1%
206.24722941
< 0.1%
207.89048231
< 0.1%
ValueCountFrequency (%)
481.03064231
< 0.1%
476.53971731
< 0.1%
475.73746021
< 0.1%
462.4742151
< 0.1%
460.1070691
< 0.1%
458.44107231
< 0.1%
455.45123371
< 0.1%
450.91445441
< 0.1%
449.26768751
< 0.1%
447.41796241
< 0.1%

Conductivity
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean426.20511
Minimum181.48375
Maximum753.34262
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:03.498500image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum181.48375
5-th percentile300.10947
Q1365.73441
median421.88497
Q3481.7923
95-th percentile566.34932
Maximum753.34262
Range571.85887
Interquartile range (IQR)116.05789

Descriptive statistics

Standard deviation80.824064
Coefficient of variation (CV)0.18963654
Kurtosis-0.27709283
Mean426.20511
Median Absolute Deviation (MAD)57.887591
Skewness0.26449022
Sum1396247.9
Variance6532.5293
MonotonicityNot monotonic
2025-10-23T09:38:03.701501image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
327.45976051
 
< 0.1%
564.30865421
 
< 0.1%
592.88535911
 
< 0.1%
418.60621311
 
< 0.1%
363.26651621
 
< 0.1%
398.41081341
 
< 0.1%
280.46791591
 
< 0.1%
283.65163351
 
< 0.1%
474.60764491
 
< 0.1%
389.37556591
 
< 0.1%
Other values (3266)3266
99.7%
ValueCountFrequency (%)
181.4837541
< 0.1%
201.61973681
< 0.1%
210.3191821
< 0.1%
217.35832961
< 0.1%
232.6136241
< 0.1%
233.90796511
< 0.1%
235.04228351
< 0.1%
245.8596321
< 0.1%
247.91803051
< 0.1%
251.02089871
< 0.1%
ValueCountFrequency (%)
753.34261961
< 0.1%
708.22636451
< 0.1%
695.3695281
< 0.1%
674.44347591
< 0.1%
672.55699921
< 0.1%
669.72508621
< 0.1%
666.69061831
< 0.1%
660.25494631
< 0.1%
657.57042181
< 0.1%
656.92412781
< 0.1%

Organic_carbon
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.28497
Minimum2.2
Maximum28.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:03.897775image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum2.2
5-th percentile8.8153617
Q112.065801
median14.218338
Q316.557652
95-th percentile19.637254
Maximum28.3
Range26.1
Interquartile range (IQR)4.4918502

Descriptive statistics

Standard deviation3.308162
Coefficient of variation (CV)0.2315834
Kurtosis0.044409307
Mean14.28497
Median Absolute Deviation (MAD)2.2322941
Skewness0.025532582
Sum46797.563
Variance10.943936
MonotonicityNot monotonic
2025-10-23T09:38:04.077118image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16.140367631
 
< 0.1%
10.379783081
 
< 0.1%
15.180013121
 
< 0.1%
16.868636931
 
< 0.1%
18.43652451
 
< 0.1%
11.558279441
 
< 0.1%
8.399734641
 
< 0.1%
13.789695321
 
< 0.1%
12.36381671
 
< 0.1%
12.706048971
 
< 0.1%
Other values (3266)3266
99.7%
ValueCountFrequency (%)
2.21
< 0.1%
4.3718986081
< 0.1%
4.4667719691
< 0.1%
4.4730922641
< 0.1%
4.8616314981
< 0.1%
4.9028880681
< 0.1%
4.9668616191
< 0.1%
5.0516946151
< 0.1%
5.1593803081
< 0.1%
5.1884664551
< 0.1%
ValueCountFrequency (%)
28.31
< 0.1%
27.006706611
< 0.1%
24.755392371
< 0.1%
23.952450441
< 0.1%
23.917601261
< 0.1%
23.667666781
< 0.1%
23.604297971
< 0.1%
23.569644911
< 0.1%
23.514773771
< 0.1%
23.399516061
< 0.1%

Trihalomethanes
Real number (ℝ)

Missing 

Distinct3114
Distinct (%)100.0%
Missing162
Missing (%)4.9%
Infinite0
Infinite (%)0.0%
Mean66.396293
Minimum0.738
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:04.266672image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum0.738
5-th percentile39.552928
Q155.844536
median66.622485
Q377.337473
95-th percentile92.124059
Maximum124
Range123.262
Interquartile range (IQR)21.492937

Descriptive statistics

Standard deviation16.175008
Coefficient of variation (CV)0.24361313
Kurtosis0.23859744
Mean66.396293
Median Absolute Deviation (MAD)10.742172
Skewness-0.083030674
Sum206758.06
Variance261.6309
MonotonicityNot monotonic
2025-10-23T09:38:04.458691image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
56.715509551
 
< 0.1%
54.473934621
 
< 0.1%
77.730814371
 
< 0.1%
90.394894721
 
< 0.1%
37.787096641
 
< 0.1%
78.92552711
 
< 0.1%
89.477718371
 
< 0.1%
69.5267181
 
< 0.1%
72.573959381
 
< 0.1%
57.780869321
 
< 0.1%
Other values (3104)3104
94.7%
(Missing)162
 
4.9%
ValueCountFrequency (%)
0.7381
< 0.1%
8.1758763841
< 0.1%
8.5770129331
< 0.1%
14.343161451
< 0.1%
15.68487681
< 0.1%
16.29150461
< 0.1%
17.000682931
< 0.1%
17.527764961
< 0.1%
17.915722571
< 0.1%
18.015272361
< 0.1%
ValueCountFrequency (%)
1241
< 0.1%
120.0300771
< 0.1%
118.35727471
< 0.1%
116.16162161
< 0.1%
114.20867141
< 0.1%
114.03494571
< 0.1%
113.04888571
< 0.1%
112.6227331
< 0.1%
112.41221041
< 0.1%
112.06102741
< 0.1%

Turbidity
Real number (ℝ)

Unique 

Distinct3276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.9667862
Minimum1.45
Maximum6.739
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.7 KiB
2025-10-23T09:38:04.637746image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1.45
5-th percentile2.6842792
Q13.4397109
median3.9550276
Q34.5003198
95-th percentile5.2209245
Maximum6.739
Range5.289
Interquartile range (IQR)1.0606089

Descriptive statistics

Standard deviation0.78038241
Coefficient of variation (CV)0.19672913
Kurtosis-0.062800641
Mean3.9667862
Median Absolute Deviation (MAD)0.53029624
Skewness-0.0078166424
Sum12995.191
Variance0.6089967
MonotonicityNot monotonic
2025-10-23T09:38:04.826759image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.3091490571
 
< 0.1%
2.9631353811
 
< 0.1%
4.5006562751
 
< 0.1%
3.055933751
 
< 0.1%
4.6287705371
 
< 0.1%
4.0750754251
 
< 0.1%
2.5597082281
 
< 0.1%
2.6729887371
 
< 0.1%
4.4014247151
 
< 0.1%
3.5950171811
 
< 0.1%
Other values (3266)3266
99.7%
ValueCountFrequency (%)
1.451
< 0.1%
1.4922066151
< 0.1%
1.4961009431
< 0.1%
1.641515011
< 0.1%
1.6597993851
< 0.1%
1.6805540251
< 0.1%
1.6876245051
< 0.1%
1.8013269991
< 0.1%
1.812528941
< 0.1%
1.8443716041
< 0.1%
ValueCountFrequency (%)
6.7391
< 0.1%
6.4947485561
< 0.1%
6.4942494671
< 0.1%
6.3891610091
< 0.1%
6.357438521
< 0.1%
6.3076784721
< 0.1%
6.2265804051
< 0.1%
6.2048463591
< 0.1%
6.0996318731
< 0.1%
6.0837723541
< 0.1%

Potability
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.7 KiB
0
1998 
1
1278 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters3276
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
01998
61.0%
11278
39.0%

Length

2025-10-23T09:38:05.001278image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-10-23T09:38:05.102300image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ValueCountFrequency (%)
01998
61.0%
11278
39.0%

Most occurring characters

ValueCountFrequency (%)
01998
61.0%
11278
39.0%

Most occurring categories

ValueCountFrequency (%)
(unknown)3276
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
01998
61.0%
11278
39.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown)3276
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
01998
61.0%
11278
39.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown)3276
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
01998
61.0%
11278
39.0%

Interactions

2025-10-23T09:37:59.519969image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:48.541736image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.928344image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.234904image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:52.581361image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.954203image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:55.279308image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.623509image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.911704image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:59.790525image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:48.712180image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.071823image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.384421image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:52.742063image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.116044image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:55.429141image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.766954image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:58.079813image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:59.950800image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:48.862547image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.210163image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.527062image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:52.890523image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.257301image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:55.573029image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.907204image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:58.223932image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:38:00.093475image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.014249image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.353661image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.682545image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.052616image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.414126image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:55.725456image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.049780image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:58.374336image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:38:00.238656image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.160501image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.506734image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.827888image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.195231image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.560863image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:55.877260image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.217329image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:58.540534image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:38:00.376229image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.323551image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.650964image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.973195image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.338164image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.702275image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.025041image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.356510image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:58.688820image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:38:00.512592image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.469590image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.801600image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:52.117642image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.492337image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.844360image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.170779image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.496254image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:58.850015image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:38:00.645301image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.612850image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:50.937099image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:52.259525image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.645198image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:54.980403image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.328315image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.629850image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:59.002298image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:38:00.785020image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:49.763589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:51.092231image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:52.423114image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:53.794426image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:55.123488image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:56.480813image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:57.771577image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-10-23T09:37:59.374495image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-10-23T09:38:05.195821image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
ChloraminesConductivityHardnessOrganic_carbonPotabilitySolidsSulfateTrihalomethanesTurbidityph
Chloramines1.000-0.017-0.025-0.0120.077-0.0550.0370.018-0.008-0.042
Conductivity-0.0171.000-0.0330.0210.0000.021-0.022-0.0040.0100.017
Hardness-0.025-0.0331.0000.0030.079-0.053-0.095-0.012-0.0130.116
Organic_carbon-0.0120.0210.0031.0000.0150.0180.020-0.008-0.0250.044
Potability0.0770.0000.0790.0151.0000.0250.1510.0000.0000.084
Solids-0.0550.021-0.0530.0180.0251.000-0.154-0.0200.028-0.075
Sulfate0.037-0.022-0.0950.0200.151-0.1541.000-0.031-0.0190.024
Trihalomethanes0.018-0.004-0.012-0.0080.000-0.020-0.0311.000-0.0280.005
Turbidity-0.0080.010-0.013-0.0250.0000.028-0.019-0.0281.000-0.049
ph-0.0420.0170.1160.0440.084-0.0750.0240.005-0.0491.000

Missing values

2025-10-23T09:38:01.014726image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-10-23T09:38:01.174888image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2025-10-23T09:38:01.376712image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

phHardnessSolidsChloraminesSulfateConductivityOrganic_carbonTrihalomethanesTurbidityPotability
0NaN204.89045520791.3189817.300212368.516441564.30865410.37978386.9909702.9631350
13.716080129.42292118630.0578586.635246NaN592.88535915.18001356.3290764.5006560
28.099124224.23625919909.5417329.275884NaN418.60621316.86863766.4200933.0559340
38.316766214.37339422018.4174418.059332356.886136363.26651618.436524100.3416744.6287710
49.092223181.10150917978.9863396.546600310.135738398.41081311.55827931.9979934.0750750
55.584087188.31332428748.6877397.544869326.678363280.4679168.39973554.9178622.5597080
610.223862248.07173528749.7165447.513408393.663396283.65163413.78969584.6035562.6729890
78.635849203.36152313672.0917644.563009303.309771474.60764512.36381762.7983094.4014250
8NaN118.98857914285.5838547.804174268.646941389.37556612.70604953.9288463.5950170
911.180284227.23146925484.5084919.077200404.041635563.88548117.92780671.9766014.3705620
phHardnessSolidsChloraminesSulfateConductivityOrganic_carbonTrihalomethanesTurbidityPotability
32668.372910169.08705214622.7454947.547984NaN464.52555211.08302738.4351514.9063581
32678.989900215.04735815921.4120186.297312312.931022390.4102319.89911555.0693044.6138431
32686.702547207.32108617246.9203477.708117304.510230329.26600216.21730328.8786013.4429831
326911.49101194.81254537188.8260229.263166258.930600439.89361816.17275541.5585014.3692641
32706.069616186.65904026138.7801917.747547345.700257415.88695512.06762060.4199213.6697121
32714.668102193.68173547580.9916037.166639359.948574526.42417113.89441966.6876954.4358211
32727.808856193.55321217329.8021608.061362NaN392.44958019.903225NaN2.7982431
32739.419510175.76264633155.5782187.350233NaN432.04478311.03907069.8454003.2988751
32745.126763230.60375811983.8693766.303357NaN402.88311311.16894677.4882134.7086581
32757.874671195.10229917404.1770617.509306NaN327.45976016.14036878.6984462.3091491